Picture for Benjamin Van Durme

Benjamin Van Durme

Johns Hopkins University

Conformal Thinking: Risk Control for Reasoning on a Compute Budget

Add code
Feb 03, 2026
Viaarxiv icon

RANKVIDEO: Reasoning Reranking for Text-to-Video Retrieval

Add code
Feb 03, 2026
Viaarxiv icon

HLTCOE Evaluation Team at TREC 2025: VQA Track

Add code
Dec 08, 2025
Viaarxiv icon

All Claims Are Equal, but Some Claims Are More Equal Than Others: Importance-Sensitive Factuality Evaluation of LLM Generations

Add code
Oct 08, 2025
Figure 1 for All Claims Are Equal, but Some Claims Are More Equal Than Others: Importance-Sensitive Factuality Evaluation of LLM Generations
Figure 2 for All Claims Are Equal, but Some Claims Are More Equal Than Others: Importance-Sensitive Factuality Evaluation of LLM Generations
Figure 3 for All Claims Are Equal, but Some Claims Are More Equal Than Others: Importance-Sensitive Factuality Evaluation of LLM Generations
Figure 4 for All Claims Are Equal, but Some Claims Are More Equal Than Others: Importance-Sensitive Factuality Evaluation of LLM Generations
Viaarxiv icon

mmBERT: A Modern Multilingual Encoder with Annealed Language Learning

Add code
Sep 08, 2025
Viaarxiv icon

Enabling Equitable Access to Trustworthy Financial Reasoning

Add code
Aug 28, 2025
Viaarxiv icon

MegaWika 2: A More Comprehensive Multilingual Collection of Articles and their Sources

Add code
Aug 05, 2025
Viaarxiv icon

Seq vs Seq: An Open Suite of Paired Encoders and Decoders

Add code
Jul 15, 2025
Viaarxiv icon

How Grounded is Wikipedia? A Study on Structured Evidential Support

Add code
Jun 14, 2025
Viaarxiv icon

Jailbreak Distillation: Renewable Safety Benchmarking

Add code
May 28, 2025
Viaarxiv icon